21 research outputs found

    Heterogeneous biomedical database integration using a hybrid strategy: a p53 cancer research database.

    Get PDF
    Complex problems in life science research give rise to multidisciplinary collaboration, and hence, to the need for heterogeneous database integration. The tumor suppressor p53 is mutated in close to 50% of human cancers, and a small drug-like molecule with the ability to restore native function to cancerous p53 mutants is a long-held medical goal of cancer treatment. The Cancer Research DataBase (CRDB) was designed in support of a project to find such small molecules. As a cancer informatics project, the CRDB involved small molecule data, computational docking results, functional assays, and protein structure data. As an example of the hybrid strategy for data integration, it combined the mediation and data warehousing approaches. This paper uses the CRDB to illustrate the hybrid strategy as a viable approach to heterogeneous data integration in biomedicine, and provides a design method for those considering similar systems. More efficient data sharing implies increased productivity, and, hopefully, improved chances of success in cancer research. (Code and database schemas are freely downloadable, http://www.igb.uci.edu/research/research.html.)

    Do Acute Coronary Events Affect Lipid Management and Cholesterol Goal Attainment in Germany?

    Get PDF
    Objective To document utilization of lipid-lowering therapy, attainment of low-density lipoprotein cholesterol target values, and cardiovascular outcomes in patients hospitalized for acute coronary syndrome in Germany. Methods The Dyslipidemia International Study II was a multicenter, observational study of the prevalence of dyslipidemia and lipid target value attainment in patients surviving any acute coronary syndrome event. Among patients on lipid-lowering therapy for ā‰„3 months, use of lipid-lowering therapy and lipid profiles were assessed at admission and again at 120 Ā± 15 days after admission (the follow-up time point). Multivariate logistic regression was used to identify variables predictive of low-density lipoprotein cholesterol target value attainment in patients using lipid-lowering therapy. Results A total of 461 patients hospitalized for acute coronary syndrome were identified, 270 (58.6%) of whom were on lipid-lowering therapy at admission. Among patients on lipid-lowering therapy, 90.7% and 85.9% were receiving statin monotherapy at admission and follow-up, respectively. Mean (SD) lowdensity lipoprotein cholesterol levels in patients on lipid-lowering therapy were 101 (40) mg/dl and 95 (30) mg/dl at admission and follow-up, respectively. In patients with data at both admission and followup (n= 61), low-density lipoprotein cholesterol target value attainment rates were the same (19.7%) at both time points. Smoking was associated with a 77% lower likelihood of attaining the low-density lipoprotein cholesterol target value. Conclusion Hospitalization for an acute event does not greatly alter lipid management in acute coronary syndrome patients in Germany. Both lipid-lowering therapy doses and rates of low-density lipoprotein cholesterol target value attainment remained essentially the same several months after the event

    Genome wide screens in yeast to identify potential binding sites and target genes of DNA-binding proteins

    Get PDF
    Knowledge of all binding sites for transcriptional activators and repressors is essential for computationally aided identification of transcriptional networks. The techniques developed for defining the binding sites of transcription factors tend to be cumbersome and not adaptable to high throughput. We refined a versatile yeast strategy to rapidly and efficiently identify genomic targets of DNA-binding proteins. Yeast expressing a transcription factor is mated to yeast containing a library of genomic fragments cloned upstream of the reporter gene URA3. DNA fragments with target-binding sites are identified by growth of yeast clones in media lacking uracil. The experimental approach was validated with the tumor suppressor protein p53 and the forkhead protein FoxI1 using genomic libraries for zebrafish and mouse generated by shotgun cloning of short genomic fragments. Computational analysis of the genomic fragments recapitulated the published consensus-binding site for each protein. Identified fragments were mapped to identify the genomic context of each binding site. Our yeast screening strategy, combined with bioinformatics approaches, will allow both detailed and high-throughput characterization of transcription factors, scalable to the analysis of all putative DNA-binding proteins

    Heterogeneous Biomedical Database Integration Using a Hybrid Strategy: A p53 Cancer Research Database

    No full text
    Complex problems in life science research give rise to multidisciplinary collaboration, and hence, to the need for heterogeneous database integration. The tumor suppressor p53 is mutated in close to 50% of human cancers, and a small drug-like molecule with the ability to restore native function to cancerous p53 mutants is a long-held medical goal of cancer treatment. The Cancer Research DataBase (CRDB) was designed in support of a project to find such small molecules. As a cancer informatics project, the CRDB involved small molecule data, computational docking results, functional assays, and protein structure data. As an example of the hybrid strategy for data integration, it combined the mediation and data warehousing approaches. This paper uses the CRDB to illustrate the hybrid strategy as a viable approach to heterogeneous data integration in biomedicine, and provides a design method for those considering similar systems. More efficient data sharing implies increased productivity, and, hopefully, improved chances of success in cancer research. (Code and database schemas are freely downloadable, http://www.igb.uci.edu/research/research.html.

    Choosing where to look next in a mutation sequence space: Active Learning of informative p53 cancer rescue mutants.

    No full text
    MOTIVATION: Many biomedical projects would benefit from reducing the time and expense of in vitro experimentation by using computer models for in silico predictions. These models may help determine which expensive biological data are most useful to acquire next. Active Learning techniques for choosing the most informative data enable biologists and computer scientists to optimize experimental data choices for rapid discovery of biological function. To explore design choices that affect this desirable behavior, five novel and five existing Active Learning techniques, together with three control methods, were tested on 57 previously unknown p53 cancer rescue mutants for their ability to build classifiers that predict protein function. The best of these techniques, Maximum Curiosity, improved the baseline accuracy of 56-77%. This article shows that Active Learning is a useful tool for biomedical research, and provides a case study of interest to others facing similar discovery challenges
    corecore